Latent label mining for group activity recognition in basketball videos

نویسندگان

چکیده

Motion information has been widely exploited for group activity recognition in sports video. However, order to model and extract the various motion between adjacent frames, existing algorithms only use coarse video-level labels as supervision cues. This may lead ambiguity of extracted features omission changing rules patterns that are also important video recognition. In this paper, a latent label mining strategy basketball videos is proposed. The authors' novel allows them obtain set marking different frames an unsupervised way, build frame-level representations with two separate levels signal. Firstly, digged using hierarchical clustering technique. generated then taken signal train deep CNN extraction. Lastly, fed into LSTM network spatio-temporal representation Experimental results on public NCAA dataset demonstrate proposed algorithm achieves state-of-the-art performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human Activity Recognition in Videos

Our project deals with the problem of classifying real-world videos by human activity. Such videos usually have a large variation in background and camera motion. This makes the performance of models using low-level appearance and motion features unsatisfactory, particularly in the case of video classes sharing similar objects and background (e.g. ”snatch” and ”clean-jerk” weightlifting actions...

متن کامل

A New Network-Based Algorithm for Human Group Activity Recognition in Videos

In this paper, a new network-based (NB) algorithm is proposed for human group activity recognition in videos. The proposed NB algorithm introduces three different networks for modeling the correlation among people as well as the correlation between people and the surrounding scene. With the proposed network models, human group activities can be modeled as the package transmission process in the...

متن کامل

Activity Recognition in Egocentric Life-Logging Videos

With the increasing availability of wearable cameras, research on first-person view videos (egocentric videos) has received much attention recently. While some effort has been devoted to collecting various egocentric video datasets, there has not been a focused effort in assembling one that could capture the diversity and complexity of activities related to life-logging, which is expected to be...

متن کامل

HiRF: Hierarchical Random Field for Collective Activity Recognition in Videos

This paper addresses the problem of recognizing and localizing coherent activities of a group of people, called collective activities, in video. Related work has argued the benefits of capturing long-range and higher-order dependencies among video features for robust recognition. To this end, we formulate a new deep model, called Hierarchical Random Field (HiRF). HiRF models only hierarchical d...

متن کامل

Feature Learning for Interaction Activity Recognition in RGBD Videos

This paper proposes a human activity recognition method which is based on features learned from 3D video data without incorporating domain knowledge. The experiments on data collected by RGBD cameras produce results outperforming other techniques. Our feature encoding method follows the bag-of-visual-word model, then we use a SVM classifier to recognise the activities. We do not use skeleton or...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Iet Image Processing

سال: 2021

ISSN: ['1751-9659', '1751-9667']

DOI: https://doi.org/10.1049/ipr2.12265